Searching Social Updates for Topic-centric Entities
نویسندگان
چکیده
With the growing popularity of social networking services, real time short messages, such as Facebook news feeds and Twitter tweets, are becoming increasingly important information sources. People use these services to search for and consume content about interesting topics and events. Given a keyword search for a certain topic, simply returning those messages often does not give a comprehensive summary of the topic, primarily due to the brevity and redundancy of the messages. To address this challenge, we propose a topic centric entity extraction system where interesting entities pertaining to a topic are mined and extracted from short messages returned as search results on the topic. Specifically, we leverage signals from three main aspects: message content, social connections (i.e., message sender’s follower network), and referenced Web pages (i.e., URLs embedded within the messages), and propose: 1) page ranking algorithms for identifying relevant pages embedded within the messages; and 2) entity ranking algorithms for identifying relevant entities extracted from those URLs. Comprehensive experiments using real Twitter data show that our ranking algorithms are efficient and outperform baseline algorithms significantly in terms of extraction quality.
منابع مشابه
User Activity Analytics on the Social Web of News
The proliferation of social media is undoubtedly changing the way people produce and consume news online. Editors and publishers in newsrooms need to understand user engagement and audience sentiment evolution on various news topics. News consumers want to explore public reaction on articles relevant to a topic and refine their exploration via related entities, topics, articles and tweets. I wi...
متن کاملB-hist: Entity-centric search over personal web browsing history
Web Search is increasingly entity-centric; as many common queries target specific entities, search results are progressively augmented with semi-structured and multimedia information about entities. However, search over personal Web browsing history still revolves around keyword-search mostly. B-hist aims at providing Web users with an effective tool for searching and accessing information prev...
متن کاملMulti-aspect Entity-Centric Analysis of Big Social Media Archives
Social media archives serve as important historical information sources, and thus meaningful analysis and exploration methods are of immense value for historians, sociologists and other interested parties. In this paper, we propose an entity-centric approach to analyze social media archives and we define measures that allow studying how entities are reflected in social media in different time p...
متن کاملHuman-Centric Decision-Making Models for Social Sciences
It's not surprisingly when entering this site to get the book. One of the popular books now is the human centric decision making models for social sciences. You may be confused because you can't find the book in the book store around your city. Commonly, the popular book will be sold quickly. And when you have found the store to buy the book, it will be so hurt when you run out of it. This is w...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کامل